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PEPTIDE AMINOACID SEQUENCES 
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N-TERMINUS 100KD PROTEIN 



Position 
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FIG.5 




GTCGACTTCCCGTCCTATTCGGCCTCGTCCGCTGAAGATCCATCCCACCA 
Sail 



TTGCACGTGGGCCACCTTTGTGAGCTTCTAACCTGAACTGGTAGAGTATC 100 



ACACACCATGCCAAGGTGGGATGAAGGGGTTATATGAGACCGTCCGGTCC 



GGCGCGATGGCCGTAGCTGCCACTCGCTGCTGTGCAAGAAATTACTTCTC 2 00 



ATAGGCATCATGGGCGTCTCTGCTGTTCTACTTCCTTTGTATCTCCTGTC 
translation start 



TGG GTATGCTAAGCACCACAATCAAAGTCTAATAAGGACCCTCCCTTCCG 
s tart< 



300 



AGGGCCCCTGAAGCTCGGACTGTGTGGGACTACTGATCGCTGACTATCTG 
— intron 



TGCAG AGTCACCTCCGGACTGGCAGTCCCCGCCTCGAGAAATCAATCCAG 
->end 



400 



TTGCGATACGGTCGATCAGGGGTATCAATGCTTCTCCGAGACTTCGCATC 



TTTGGGGTCAATACGCACCGTTCTTCTCTCTGGCAAACGAATCGGTCATC 500 



TCCCCTGAGGTGCCCGCCGGATGCAGAGTCACTTTCGCTCAGGTCCTCTC 



CCGTCATGGAGCGCGGTATCCGACCGACTCCAAGGGCAAGAAATACTCCG 600 



CTCTCATTGAGGAGATCCAGCAGAACGCGACCACCTTTGACGGAAAATAT 



GCCTTCCTGAAGACATACAACTACAGCTTGGGTGCAGATGACCTGACTCC 700 



CTTCGGAGAACAGGAGCTAGTCAACTCCGGCATCAAGTTCTACCAGCGGT 



ACG AATCGCTCACAAGGAACATCGTTCGATTCATCCGATCCTCTGGCTCC 8 0 0 



AGCCGCGTGATCGCCTCCGGCAAGAAATTCATCGAGGGCTTCCAGAGCAC 



CAAGCTGAAGGATCCTCGTGCCCAGCCCGGCCAATCGTCGCCCAAGATCG 9 00 
BamHI 

ACGTGGTCATTTCCGAGGCCAGCTCATCCAACAACACTCTCGACCCAGGC 



ACCTGCACTGTCTTCGAAGACAGCGAATTGGCCGATACCGTCGAAGCCAA 1000 



TTTCACCGCCACGTTCGTCCCCTCCATTCGTCAACGTCTGGAGAACGACC 

TGTCCGGTGTGACTCTCACAGACACAGAAGTGACCTACCTCATGGACATG 1100 

TGCTCCTTCGACACCATCTCCACCAGCACC GTCGAC ACCAAGCTGTCCCC 

Sail 

• • • 

CTTCTGTGACCTGTTCACCCATGACGAATGGATCAACTACGACTACCTCC 12 0 0 

AGTCCTTGAAAAAGTATTACGGCCATGGTGCAGGTAACCCGCTCGGCCCG 
ACCCAGGGCGTCGGCTACGCTAACGAGCTCATCGCCCGTCTGACCCACTC 13 00 
GCCTGTCCACGATGACACCAGTTCCAACCACACTTTGGACTCGAGCCCGG 
CTACCTTTCCGCTCAACTCTACTCTCTACGCGGACTTTTCGCATGACAAC 14 00 
GGC ATCATCTCCATTCTCTTTGCTTTAGGTCTGT AC AACGGC ACT AAGCC 
GCTATCTACCACGACCGTGGAGAATATCACCCAGACAGATGGATTCTCGT 150 0 
CTGCTTGGACGGTTCCGTTTGCTTCGCGTTTGTACGTCGAGATGATGCAG 

TGTCAGGCGGAGCAGGAGCCGCTGGTCCGTGTCTTGGTTAATGATCGCGT 1600 

m . • • • 

TGTCCCGCTGCATGGGTGTCCGGTTGATGCTTTGGGGAGATGTACCCGGG 

ATAGCTTTGTGAGGGGGTTGAGCTTTGCTAGATCTGGGGGTGATTGGGCG 1700 

GAGTGTTTTGCTTAGCTGAATTACCTTGATGAATGGTATGTATCACATTG 
translation stop 

CATATCATTAGCACTTCAGGTATGTATTATCGAAGATGTATATCGAAAGG 18 00 

ATCAATGGTGACTGTCACTGGTTATCTG7LATATCCCTCTATACCTCGTCC 
- 

CACAACCAATCATCACCCTTTAAACAATCACACTCAACGCACAGCGTACA 19 00 
AACG7VACAAACGCACAAAGAATATTTTACACTCCTCCCCAACGCAATACC 
AACCGCAATTCATCATACCTCATATAAATACAATACAATACAATACATCC 2000 

FIG.6B 



ATCCCTACCCTCAAGTCCACGCATCCTATAATCAATCCCTACTTACTTAC 
TTCTCCCCCTCCCCCTCACCCTTCCCAGAACTCACCCCCGAAGTAGTAAT 2100 
AGTAGTAGTAGAAGAAGCAGACGACCTCTCCACCAATCTCTTCGGCCTCT 

TATCCCCATACGCTACACAAAACCCCCACCCCGTTAGCATGCACTCAGAA 2 2 00 

• * ■ * 

AATAATCAAAAATAACTAAGAAGGAAAAAAAAGAAGAAGAAAGGTTACAT 

ACTCCTCTCATACAAACTCCAAGACGTATACATCAAGATGGGCAATCCCA 2 3 00 

CCATTACTGATATCCATCTATGAACCCATTCCCATCCCACGTTAGTTGAT 

TACTTTACTTAGAAG AAGAAAAAGGG AAGGGAAGGGAAAGAAGTGGATGG 2 4 0 0 

GATTGAGTTAGTGCTCACCGTCTCGCAGCAAGTTTATATTCTTTTGTTTG 
- 

GCGG ATATCTTTCACTGCTCCTGCTGGACGTTGTCACGGGGTGGTAGTGG 2 5 00 
TTGGCGGTGGTGAGGGTCCATGATCACTCTTGGTTTGGGGGGTTGTTGTT 
GTCGTTGTTGTTGTTGTTGGGTGGGCATTTTCTTTTCTTCACTTGGGGAT 2 60 0 
TATTATTTGGAATTGGTTAGTTTGAGTGAGTGGGTAATATTGAATGGGTG 
ATTATTGGG AATG AAGTAGATTTGGCTATGAATGGTTGATGGG ATGGAAT 27 00 
GAATGGATGGATGAATAGATGGAGGCGGAAAAGTCAGGTGGTTTGAGGTT 
CGG ATTATTATCTTTGTGCCTG AGGCATCACTCTCCATCTATGTTGTTCT 2 8 0 0 
TTCTATACCGATCTACCAGAGCTAAGTTGACTGATTCTACCACAGTGCAC 
AATAAGTATGTACTTATTTCATTTAGAGTATTTAG ATTAACCCGCTGTGC '2900 
TATTTGCCGTAGCTTTCCACCCAATTTCGAAGTTCGAAGAATTAAAACTC 
ATCCTACAGTACAGAATAGAAGTAAAAGGAGAAGAGAAAAACAAGATAAT 3 000 

FIG.6C 



ACAACCAGTCCAGGTCCATTCTAGATCTCGAATGACCACCAAATAAGAAA 
GCAACAAGCAAGTAAGCAAAGCATAAGTCTAAATGAACGCCAATAACTTC 310 0 

ATCGCCTGCCTTTGAAACTGAACGCTATGCACGAATGGCTCGAAATGATT 

• • • » • 

CCCTTAACTCCGTAGTATTG AG AGTGAGAGGAAAAG AAAAAAAG AG ACAG 3 2 0 0 

• « • * • 

AAAAGCTGACCATGGGAAAGAAGCATGATCAGTCGGGAATGGATCTGCGG 

GTTGAGATAGATATGAGTTGCCTCGCAGATCCGGTGACAAGATAAGAGAA 3 3 00 
TTGGGAGATGTGATCAGCCACTGTAACTTCATCAAGCATCGACATTCAAC 
GGTCGGGTCTGCGGGTTG AGATGCAAGTTGAGATGCCACGCAGACCCGAA 3 4 0 0 

CAGAGTGAGAGATGTGAGACTTTTGAACCACTGTGACTTCATCAAGCATC 

■ • • • • 

AAAACACACTCCATGGTCAATCGGTTAGGGTGTGAGGGTTGATATGCCAG 3 5 00 

GTTCGATGCCACGCAGACCCGAACCGACTGAGAAATATGAAAAGTTGGAC 

m • • • • 

AGCCACTTCATCTTCATCAAGCGTAAAACCCCAATCAATGGTAAATCGAA 3 6 00 

• * • • • 

AACGAATCTGCGGGCTGATGTGGAAATGAGACGAATGCCTCGCAGATTCG 

AAGACACGTAAATCGAGATGAACAATCACTTTAACTTCATCAAAGCCTTA 3 7 00 

• • • * • 

AATCACCCAATGGCCAGTCTATTCGGGTCTGCGGGTTGAGGTTCCTGTTG 

w a • * ■ 

AGATGCCACGCAGACTGCGAACATGCGATGCATTATAAGTTGGACGAGTG 3 8 00 

« • • • * 

TAGACTGACCATTGATAACCGAGATAAACAATCACTTCAACTTCATCAAA 

GCCTTAAATCACTCAATGGCCAGTCTGTTTGCGGTCTGCGGGCTGATACC 3 9 00 

• ■ * • * 
CAAGTTGCGATGCCACGCAGACTGCAAACATTGATCGAGAGACGAGAAAA 

ACAACGCACTTTAACTTCAACAAAAGCCTTTCAATCAGTCAATGGCCAGT 4 000 

FIG.6D 




CGAACATGCGATGTAATTTCTTAGTTAGACGAGTGCCTGGCCATTGAGAA 4 100 

ACGAGAGAAACAACCACTTTAACTTCATGAAAGCCTTGAACTACTCAATG 
- 

ACCCGTCTGTTGGCGGTCTGCGGGCTGATATTCGAGTTGAGATGCCACGC 4 2 00 
AGACCGCCAAGATGCGATGTATCATGTAAGTTAGATGAGTGACTGGCCAT 
TGAGAAACGAGAGAAACAACCACACTTCATGAGAGCCTTAAATTATTCAA 4 3 0 0 

TGACCAGTCTGTTCACGGTCTGCGGGTTGGTATGCGAGTCGAGGTGCCTC 

• • ■ * 

GCAGACCGCGAACATGCGATGTTTTCGATGGACGAGTGAAGCCTGACGAT 4 4 00 

CGAGAACTATCTCAGTTGGGTTGGCCATTCGGCTGGCCGTTGGGTTTAGT 

ATTAGGATCGTCAGGTTTGTCCGATGGAACGTTCCGTTTGCGTGCGTTGG 4 500 
CGCGACGAGCCCTCTCCTCGGCGTGATTCTGAAATTCTGCAATCAGGGCA 

GCCGCAGCACGGCGACGGG ACGTCCTCCAGGAGCTGTGTTG AAGTTTCGG 4 6 00 
GGTGGCGGTCCAGAAGGGGGAGTTACATTAAAAGCCTCATAGATGTCTTT 

GGGTGGTTCCGGGGGGCCCATCGCAAGATCTTCTGGAGTTGTGCGTCTGA 4 7 00 
TCATCTCTTGAGTGTAATTGCGACGCAGACCGAGCTTCAGGATTTTGGAA 

GGGCTGGATCGCTCCTGCTGACTCTTTCCCTCAGCGGGCTTCGTCTCGGC 4 8 00 
AGTCTTCATTTCGGCGGGCTGATCTTCCATCTCAGAATGGGATCGCTTTC 

TGGTCGCTGCACCCGCTCCTCCCTTCAAGGTCAGCTTGATGCGCAGCGTC 4 9 00 
TTGGGCGGCTCAGCTGGTGGAGTTGGTTCCGGCTCTGGCTCCCTCCGGCG 

TCGCTTGGGCACTTGAGTAGTCTCTGAGGCTTCGCCGCGGCGCCGTTTGC 5000 

CTIO OCT 



_;i.im::« 7 m :f o «i „,..i. .1,0 gas 
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GAGTCGGCTCCTTGGTCTCTTTGGCCTCTTTCACTTCACCTGGACCGTCT 
TTCGGGGCGGTTTCATCGTGCTGAGCGATCAAGGTTTGGATGTAGGCAGC 510 0 
CGGCATCATTCGATCAACGGCAATTCCTCTCTTGCGGGCCTCCTCCCGAG 
CCTTGATTGTCGCCTTGACCTCGTCCACGTTTTCGAAGAAGAAAGGCATC 52 00 
TTGTTATCCTGAGGCAAGTTGCGCTCTCCCATGCGTGGGGATATCCGAAG 
ATGCGGTCCTTCTCGAACTGTTCATGAGACTTCAGACGAATTGG AGGCTG 5 3 0 0 
GGGGAGCAATTTGTCTCCGTAGGTGTTGTTAGGGCGGAACCAAGAATAGC 
CTTCGCCTACAACGACAAGCTCTTCGCCAAATTTATTTTTTTGGCCTGTA 54 00 
AAAACGAACCCATCCTCGTCAGTCCACCGGTGCGTCTCGGACGTAGAGAT 
TGGCTTACTTATTCCCTCAACGCCGATCTCTGCCTGGGGCTGCGCTTCGG 5500 
ATGCGGCCTCGGTCACGGCTCCGCCTCGGACTGCACCGCTGGAGTTTCGG 
TCTTCTTCTCCTGCTTCTCCAGGTACTCCTTGCGTAACTCTTCGATCAGC 5600 
CTCGGCTTCCGATGACTGCTCAAATTCTGGAGCAACAGCTG CCGCGGCCA 
GGTCAAGCAGGCGGTTTGCTAAAACTGCCCATTTTCCATCGACACCTGCC 57 00 
TCCGACGCCTGTGCAAAACCAGCTGTTTTCGCATTGGCCTGTTTGTTGGC 
ACGCGTCTTCTTGACTGCTGCCTTGCCCTTTACTTCCTTGAGAGCAGACT 5800 
CTGGCTTAGATGATGGTGCACGGTTTCTGCGG7VAGCGCCGCTCAGATTCC 

AAAGATTCCATAGCTTTAATGGTAGGCTTTCTGGTTCTTCCAGAAGTGCG 59 00 

• • • • • 

CGCAGCTGACGTAGTGGTTGAGTAGCTGGCAGTTGGGGATCCTGGGCCCT 

1 

CATTGG AACCATCAAGACCAAATTTGTTTCCATACATATCAGCATGGTAT 60 0 0 

FIG.6F 
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TCAAAAGGAAAACTTTCGCCGTACGGAGTACTGCGTTCGATTCCGGGTGT 
ATCCAAGTCGTATCCAGACATGGTGTCGAATTCAGCCTTGCTGTCAAGAG 610 0 
CAGGGGTACTTTCAATGCTGTCAGCAACCACGCGGCCAAAGGGCGTCTTC 
GGGAAAG AAGGTGTTTCAAG AG AAGCGTCATCCACGGCCTGGCTTGCGGC 6 2 0 0 

GTTGATTGCAGACTTTCGAGTAGATCGCTGAGGTCGCGAACTGGTTCGAG 

• • • • • 

TAGCAACCTGTGAATTGGCAGCCTTGTGACTGCTTCGATTCACTGCAGAG 6 3 00 

• • • • ■ 

ACGGAGTAGACTGCACTGATTTGGAATTCTGAGTCGCAGCCATTCTGGAT 

. • • • * 

TTGCGTTCGGCGCGACGAGATCTCGCAGTCGTGGTACGAGGAGTAGAGCG 6 4 00 

• • • • • 

AGGCTGCGTAGCAGTGTTGCAAGCTTGGTGCTAGCCTCCTGGGCTTCAGC 

• • • ■ • • 
AGCTTCAGCAGTGGTGGCAGACGCAGCAGAATTAGCGGAGCTTTATCGGC 6 5 0 0 

TTTGCCGCTCTGAGCGTTGGGAGTAGAAGTGAGAGAAGAGGTAGAGTCCA 

• * • • • 
CGGAAGAAGTCTTCTCGCTGTTCTCAAAGCCGTTCAGCTTTGCTGGCATA 6 6 0 0 

• • • - • 

GACTTACGCGTCTTGCGGCTGTTGGAAGCGGAAGAGTTCATGGCGGGAGA 

■ • • • • 

GG AG ACGTTAG AAGTAG ACATGGTGGGGTTTGTTG ACGGGTTTTG AGTAA 67 0 0* 

« • • • • 

CAAGAGACTTGCGTCGATCTTTGAGTGTTCTTGACAGAAAGTTATGCAAC 



GTCGAC 6756 
Sail 



FIG.6G 
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ATGGGCGTCTCTGCTGTTCTACTTCCTTTGTATCTCCTGTCTGGAGTCAC 
MGVSAVLLPLYLLSGVT 
-23 -20 ' -10 



CTCCGGACTGGCAGTCCCCGCCTCGAGAAATCAATCCAGTTGCGATACGG 
SGLAVPA SRNQS SCDT 
-1 +1 ' 10 



100 



TCGATCAGGGGTATCAATGCTTCTCCGAGACTTCGCATCTTTGGGGTCAA 
VDQGYQCFS ETSHLWGQ 
' 2 0 ' 



TACGCACCGTTCTTCTCTCTGGCAAACGAATCGGTCATCTCCCCTGAGGT 
YAPFFSIjANESV ISPEV 
30 ' 40 



200 



GCCCGCCGGATGCAGAGTCACTTTCGCTCAGGTCCTCTCCCGTCATGGAG 
PAGCRVTFAQVL SRHG 
'50 '60 



CGCGGTATCCGACCGACTCCAAGGGCAAGAAATACTCCGCTCTCATTGAG 
ARYPTDSKGKKYSALIE 
/ 70 ' 



300 



GAGATCCAGCAGAACGCGACCACCTTTGACGGAAAATATGCCTTCCTGAA 
EIQQNATTFDGKYAFLK 
80 ' 90 



GACATACAACTACAGCTTGGGTGCAGATGACCTGACTCCCTTCGGAGAAC 
TYNYSLGADDLTFFGE 
' 100 ' no 



400 



AGGAGCTAGTCAACTCCGGCATCAAGTTCTACCAGCGGTACGAATCGCTC 
QELVNSGIKFYQRYESL 
' 120 ' 



ACAAGGAACATCGTTCCATTCATCCGATCCTCTGGCTCCAGCCGCGTGAT 
TRNIVPFIRSSGSSRVI 
130 ' 140 



500 



CGCCTCCGGCAAGAAATTCATCGAGGGCTTCCAGAGCACCAAGCTGAAGG 
ASGKKFIEGFQSTKLK 
' 150 ' 160 



ATCCTCGTGCCCAGCCCGGCCAATCGTCGCCCAAGATCGACGTGGTCATT 
DPRAQPGQSSPKIDVVI 
' 170 ' 



600 



TCCGAGGCCAGCTCATCCAACAACACTCTCGACCCAGGCACCTGCACTGT 
SEAS S SNNTLDP GTCTV 
180 ' 190 



CTTCGAAGACAGCGAATTGGCCGATACCGTCGAAGCCAATTTCACCGCCA 
FEDSELADTVEANFTA 
' 200 ' 210 



700 



FIG.8A 




CGTTCGTCCCCTCCATTCGTCAACGTCTGGAGAACGACCTGTCCGGTGTG 
T F V P S I R QRli END L S GV 

220 ' 



ACTCTCACAGACACAGAAGTGACCTACCTCATGGACATGTGCTCCTTCGA 8 00 

TLTDTEVTYLMDMCSFD 
230 ' 240 

CACCATCTCCACCAGCACCGTCGACACCAAGCTGTCCCCCTTCTGTGACC 
TI.STSTVDTKLSPFCD 
t 250 ' 260 

TGTTC ACCCATG ACG AATGG ATCAACTACGACTACCTCCAGTCCTTG AAA 9 0 0 

LFTHDEWINYDYLQSLK 
' 270 



AAGTATTACGGCCATGGTGCAGGTAACCCGCTCGGCCCGACCCAGGGCGT 
KYYGHGAGNPLGPTQGV 
280 ' 290 

CGGCTACGCTAACGAGCTCATCGCCCGTCTGACCCACTCGCCTGTCCACG 
GYANELIARLTHSPVH 
' 300 ' 310 



1000 



ATGACACCAGTTCCAACCACACTTTGGACTCGAGCCCGGCTACCTTTCCG 
DDTSSNHTLDSSPATFP 
' 320 ' 

CTCAACTCTACTCTCTACGCGGACTTTTCGCATGACAACGGCATCATCTC 110 0 
LNSTLYADFSHDNGIIS 
330 ' 340 

CATTCTCTTTGCTTTAGGTCTGTACAACGGCACTAAGCCGCTATCTACCA 
ILFALGLYNGTKPLST 
' 350 ' 360 

CGACCGTGGAG AATATCACCCAGACAGATGGATTCTCGTCTGCTTGGACG 12 00 
TTVEHTTQTDGPSSAWT 
' 370 ' 



GTTCCGTTTGCTTCGCGTTTGTACGTCGAGATGATGCAGTGTCAGGCGGA 
VPFASRLYVEMMQCQAE 
38 0 ' 39 0 

GCAGGAGCCGCTGGTCCGTGTCTTGGTTAATGATCGCGTTGTCCCGCTGC 
QEPLVRVLVNDRVVPL 
' 400 ' 410 



1300 



ATGGGTGTCCGGTTGATGCTTTGGGGAGATGTACCCGGGATAGCTTTGTG 
HGCPVDALGRCTRDS FV 
* 420 ' 



AGGGGGTTGAGCTTTGCTAGATCTGGGGGTGATTGGGCGGAGTGTTTTGC 
RGLSFARSGGDWAECFA 
430 ' 440 



1400 



TTAG 
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AG/PHYTASE GENE FUSIONS BY PCR 
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